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Abstract — A new perspective on cognitive radio is presented, 
where the pre-existent legacy service is either uncoded or coded 
and a pair of cognitive transceivers need be appropriately 
deployed to coexist with the legacy service. The basic idea un- 
derlying the new perspective is to exploit the fact that, typically, 
the legacy channel is not fully loaded by the legacy service, thus 
leaving a non-negligible margin to accommodate the cognitive 
transmission. The exploitation of such a load margin is optimized 
by shaping the spectrum of the transmitted cognitive signal. 
It is shown that non-trivial coexistence of legacy and cognitive 
systems is possible even without sharing the legacy message with 
the cognitive transmitter. Surprisingly, the optimized cognitive 
transmitter is no longer limited by its interference power at 
the legacy receiver, and can always transmit at its full available 
device power. Analytical development and numerical illustration 
are presented, in particular focusing on the logarithmic growth 
rate, i.e., the prelog coefficient, of cognitive transmission in the 
high-power regime. 

Introduction and Overview 

Cognitive radio is envisioned as a means of improving 
utilization of scarce spectral resources, and has been attracting 
increasing attention from both scholars and practitioners (see, 
e -g-i PL [2], [3], and papers archived in [4], [5], [6] for an 
incomplete list of topics and results). Loosely speaking, in 
cognitive radio systems, unlicensed "cognitive" users poten- 
tially "sense" the spectrum utilization and transmit in such a 
manner as to cause no "harmful" interference for legacy users 
who have been licensed to use the spectrum. 

In the prevailing approach to cognitive radio, cognitive 
transmitters detect the presence of legacy service and if there 
is none, they transmit thus utilizing the "white space". The 
key challenge there has been that the legacy service detection 
problem (a.k.a. "spectrum sensing") can be extremely diffi- 
cult in the low-power regime [3], especially in a dynamic 
environment where the system parameters exhibit a certain 
amount of uncertainty. In fact, as theoretically shown and 
numerically validated in [3], there normally exists a threshold 
(called the SNR wall) such that, if the legacy signal power falls 
below that threshold, then reliable detection becomes impos- 
sible even if an infinitely long observation is made available. 
Even without such considerations, this conceptually simple 
approach of filling spectral holes (also called "white space" by 
some practitioners) in nature limits cognitive transmission to 



spectrally sparse envorinments, and inevitably achieves limited 
throughput in spectrally "dense" (e.g. urban) areas. 

Primarily motivated by the difficulties and limitations of 
the prior approach, there has been increased attention to- 
wards overlay cognitive radio systems, in which cognitive 
transmission coexists with legacy transmission through proper 
interference management. From a theoretical point of view, 
as long as the legacy service does not communicate at its 
channel capacity, i.e., the legacy channel is not "fully loaded", 
it is able to tolerate a certain amount of interference from 
the cognitive users. In practice, many legacy systems are 
broadcast systems with many potential subscribers located 
within a large geographic area, for example, analog/digital 
television, terrestrial/satellite radio, and cellular downlink. 
Since a broadcaster needs to provide a uniform quality of 
service over a long distance, for those receivers not close 
to the edge of service area, there exist abundant signal-to- 
noise (SNR) margins beyond the minimum required SNR for 
successful reception. 

Therefore, by imposing a fixed quality of service measure 
on legacy receivers, we can proceed to design the signal 
of cognitive transmission subject to that measure. In this 
paper, we take the quality of service measure as the mean- 
squared estimation error (MSE) of the legacy signal, in the 
case of uncoded legacy transmission, and as the achievable 
information rate of encoding the legacy message, in the case 
of coded legacy transmission. In the overlay cognitive radio 
approach, it is not the absence of, but rather the existence 
of, legacy transmission that we seek to exploit. Departing 
from this basic idea, the most natural design philosophy 
perhaps is that, a cognitive transmitter is constrained in a way 
such that its power, when arriving at the legacy receiver as 
interference, does not exceed a certain prescribed threshold. 
Consequently, a class of optimization problems that seek 
to maximize the cognitive achievable rate subject to legacy 
received power constraints can be formulated and solved; see 
[7] and subsequent works. 

The aforementioned approach based upon regulating legacy 
received power essentially treats the interference power itself 
as an indicator of the interference level. This is consistent 
with the current FCC regulatory policy that measures the 
interference in terms of its power and calls the power level the 



"interference temperature" [8]. A closer look at the cognitive 
transmission problem, however, suggests that it is not merely 
the power level itself, but the power spectral density (PSD), 
that dictates the achievable performance of a legacy receiver. 
This observation is the key idea underlying the development 
in our paper. 

Starting from the idea of optimizing the PSD of cognitive 
signal (called "spectrum shaping" in this paper), we treat two 
representative cases: (1) where the legacy service does not 
encode its message and thus transmits an uncoded signal, and 
(2) where the legacy message is digitally coded. To keep the 
development more accessible while conveying the essential 
conclusions, in this paper we only consider a single pair 
of cognitive transceivers (the problem of spectrum sharing 
among multiple pairs of cognitive transceivers is left for future 
investigation). 

Our main finding is that, surprisingly, through appropriately 
shaping the cognitive signal spectrum coupled with optimal 
filtering (for the case of uncoded legacy transmission) or 
decoding (for the case of coded legacy transmission), cognitive 
transmission can actually overcome the interference limitation, 
and can always transmit at its full available device power. 
This is in contrast to the aforementioned design principle 
based upon regulating legacy received power, because there the 
cognitive transmitter is forbidden to keep ramping its power 
level when hitting a certain upper limit, which is usually much 
smaller than the available device power especially when the 
distance between the cognitive transceivers is short, a typical 
scenario in proposed cognitive radio applications. 

We present analytical development and numerical illustra- 
tion for our results, in particular focusing on the logarithmic 
growth rate, i.e., the prelog coefficient, of cognitive transmis- 
sion in the high-power regime. As suggested by the previous 
discussion, this regime is the most relevant for applications, 
and also demonstrates the most of the advantages of the spec- 
trum shaping approach. Briefly speaking, our results show that 
it is always possible to achieve logarithmic growth of cognitive 
transmission rates whenever the legacy channel is not fully 
loaded. The degree of legacy channel loading, nevertheless, 
affects the logarithmic growth rate, - the heavier (lighter) 
the legacy channel is loaded, the larger (smaller) the prelog 
coefficient the cognitive transmission is allowed to achieve. 

Before entering the detailed exposition of the paper, we 
briefly remark on the connection of our development to some 
apparently related work. As will be evidenced in this paper, 
the spectrum shaping approach typically leads to a subset 
of spectral bands that are "turned on" while leaving the 
remaining spectral bands "off". This appears to be analogous 
to frequency-division multiple access (FDMA), which has 
been shown in [9] to bear certain optimal properties in spec- 
trum sharing among multiple cognitive systems for unlicensed 
bands (i.e., without a legacy system). We note that, in our 
model, we are concerned with the coexistence between legacy 
and cognitive systems, and their sharing is not FDMA, because 
the legacy signal is assumed to occupy the entire spectrum. 
The "on-off" behavior of cognitive signal PSD is the more a 



consequence of non-convex optimization. 

Another proposed approach with a strong information- 
theoretic flavor is based on the technique of "dirty paper 
coding" [10], as established in [11], [12], A key assumption 
there is that the cognitive transmitter has access to the message 
(even not only the codebook) of the legacy transmitter. With 
this additional knowledge, the cognitive transmitter thus can 
adapt its signal using dirty paper coding so as to compensate 
for the excess interference it causes to the legacy receiver, and 
the overall effect is that the legacy transmission is not affected 
by the cognitive transmission. Similar to the spectrum sharing 
approach, this approach also achieves logarithmic growth of 
cognitive transmission rates, and furthermore does not reduce 
its prelog coefficient. As such, a distributed implementation 
of dirty paper coding has been instrumental for such results. 
Disregarding the attendant complexity and non-ideal effects in 
implementing such schemes in practice (see, e.g., [13]), there 
remain a few more pressing concerns, such as scalability, pri- 
vacy, and security - thus at present, legacy message sharing is 
unlikely feasible due to technological and political constraints. 
In contrast, the spectrum sharing approach presented herein 
may be implemented using off-the-shelf technologies. 

The remainder of the paper is organized as follows. The 
main body is divided into two parts, one (Part I) for the 
case of uncoded legacy transmission, and the other (Part II) 
for the coded case. Part I contains five sections. Section 
U introduces the basic system model, which is special as 
the legacy and cognitive receivers are collocated (whereas 
separately processed), - an assumption adopted to minimize 
technical hurdles of analysis. Section [TT] gives the performance 
analysis when the legacy receiver ignores the inherent cor- 
relation structure in signals, i.e., when taking an interference- 
temperature approach. The observation there is that the achiev- 
able cognitive rate is ultimately limited by the barrier posed 
by its interference to legacy users. The main section, Section 
Hill formulates the spectrum shaping problem, discusses its 
main properties, and gives necessary conditions for achieving 
its optimal solution. Special attention is paid to the on-off 
cognitive PSD, and the resulting prelog coefficient is analyzed. 
Section [lV]presents two case studies to illustrate the analytical 
results, and numerical results show that the spectrum shaping 
approach leads to substantial rate gains compared with the 
interference-temperature approach. Lastly in this part, Section 
[V]discusses the more general situation where there are multiple 
legacy receivers and the cognitive receiver is not collocated 
with them. 

The treatment for the coded of coded legacy transmission is 
in Part II, which contains three sections. S ection IVT1 introduces 
the basic systems model, consisting of a pair of cognitive 
transceivers in the presence of a pair of legacy transceivers, 
with each transceiver being equipped with a single antenna. 
Section lvni studies various possible system behaviors in detail, 
and establishes that the cognitive rate prelog coefficient takes 
a particularly simple form, as one minus the normalized load 
of the legacy channel. Such a result thus evidently reveals the 
tradeoff between loading the legacy system and increasing the 



rate of cognitive transmission. Section lYUH further extends the 
above result to a more general situation, where the cognitive 
transceivers are equipped with multiple antennas, and shows 
that increasing the number of antennas scales the prelog 
coefficient, thus compensating for the rate loss due to legacy 
transmission. 

Finally, a section of concluding remarks concludes the 
paper, remarking on the system models used in the paper, and 
discussing some implementation issues. 

Part I: Uncoded Legacy Transmission 

I. Basic System Model: Collocated Receivers 

In this section, we introduce a simplified channel model, 
in which the cognitive receiver and the legacy receiver are 
collocated such that they receive the common signal, as 



Y„ = x /iS„+X„ + N„, 



(1) 



where the subscript n denotes sampled discrete time. The 
legacy signal S n is from a typically remote transmitter of some 
legacy service, for example, television or radio broadcasting, 
and is in uncoded analog form. The deterministic channel 
gain a models the attenuation for the legacy signal at the 
receiver, - it is assumed known at the receiver. The noise 
process N„ is comprised of thermal noise, unmodeled external 
random disturbance, and interference from other devices not 
considered in the system model. The cognitive transmitter 
sends digitally coded signals, with the resultant received 
signal, X„. A key observation we make in the paper is that 
the legacy channel gain a typically has a large dynamic range 
(tens of dBs), thus leading to a large SNR margin that will 
be exploited by the cognitive user(s). In contrast, the cognitive 
transmitter is of interest only to its corresponding receiver, and 
we consequently normalize the cognitive signal's attenuation 
to unity, for simplicity and without loss of generality in the 
collocated receiver model. 

We note that, although the model (Q]) describes the received 
signal at both a legacy receiver and a cognitive receiver, the 
two receivers have distinct goals. The legacy receiver simply 
wishes to reconstruct its legacy signal and has no knowledge 
of the cognitive user save its statistics (assumed to be passively 
identified via channel outputs). The cognitive receiver wishes 
to decode its message possibly exploiting statistical knowledge 
of the legacy signal and channel. Throughout the paper, we do 
not allow the cognitive transmitter to obtain the message of 
the legacy service provider, and deem such message-sharing 
as technologically and politically infeasible. 

We further model the legacy signal S„ as a time-correlated 
stochastic process, taking continuous values on the complex 
plane. Specifically, we assume that S„ is a wide-sense station- 
ary (WSS) zero-mean circular-symmetric complex Gaussian 
(ZMCSCG) process with PSD 0s (w), for -tt < uj < tt, which 
is known as statistical knowledge at both the legacy receiver 
and the cognitive receiver. The noise N n is also modeled as 
a WSS ZMCSCG process with a known PSD (j) N (uj). 

The legacy receiver does not possess any knowledge of the 
codebook of the cognitive signal X„, except certain statistics 
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Fig. 1. Channel model of cognitive radio with uncoded legacy service and 
collocated receivers. 



like the PSD, and thus can only estimate S n by treating 
(X ra + N„) as some form of noise. Furthermore, the legacy 
service requires that the estimate of S ra satisfies a prescribed 
distortion constraint. We take the distortion measure as the 
MSE in the paper. Meanwhile, the cognitive receiver seeks to 
reliably decode its signal X„, with a vanishing message error 
probability for asymptotically large coding block length. Since 
the legacy signal S„ is uncoded, the best the cognitive receiver 
can do is to treat it as a time-correlated Gaussian noise, in 
addition to the actual system noise N„. Figure Q] illustrates 
the channel model. 

Ignoring the legacy service's performance, the optimal input 
X„ for maximizing the cognitive transmission rate is a WSS 
ZMCSCG process, with its PSD 4>x(u) chosen to solve a 
water-filling problem for (Q]). Given the legacy service and its 
distortion constraint, however, this water-filling PSD solution 
may no longer be optimal. Furthermore, if the legacy receiver 
takes into account the full statistics of X„ in estimating S„, 
then the optimal distribution of X„ may not even be Gaussian. 
In this paper, for tractability and practicality, we nevertheless 
restrict the legacy receiver to perform linear estimation and the 
cognitive transmitter to transmit WSS signal. Thus, estimation 
MSE depends only on the PSD of S„,X„, and N n (see, e.g., 
[14]). Consequently, there is no loss in optimality to take X„ 
to be a WSS ZMCSCG process, and the problem reduces to 
designing its PSD <f>x(co). 

II. An Interference-Temperature Perspective: 
Memoryless Estimation 

In this section, we consider a suboptimal estimation pro- 
cedure, in which the legacy receiver performs a symbol- 
by-symbol minimum mean-squared error (MMSE) estimation 
of S„, without taking into account the temporal correlation 
structure of signals. Let us denote the variances of S„ , X„ and 
N„ be erf, a\, and cr^, respectively. The MSE achieved by 



memoryless MMSE estimation of S„ is 

MSE 



(2) 



Consequently, if the legacy receiver poses an MSE distortion 
constraint D upon its estimation of S n , we have that the 
cognitive signal PSD should satisfy 



2 1 r , , x , °% d 

J x = 7T 0x(w)dw < - 
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(3) 



In the presence of an additional power constraint P for 
Xn, (0 is simply another legacy-induced power constraint. 
Therefore the cognitive rate maximization problem is reduced 
to the following standard form, 



max 
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du, (4) 



(5) 



It readily follows that the standard water-filling strategy solves 
the problem, and the optimal solution is 



</>x,oM = 



(6) 



where [•]+ denotes max{-,0}, and A > is chosen such that 
5? fin <fa.fl(u)dtji) = P is satisfied. 

For memoryless MMSE, we emphasize that the achievable 
rate of the cognitive transmission is ultimately limited by 
the legacy channel gain a. For cognitive radio applications, 
the typically short-range channel for X„ makes the power 
constraint P usually large. Due to the upper limit imposed by 
a, however, the maximum power of the cognitive transmission 
is severely regulated by Pq, and thus the cognitive rate R c 
behaves like 0(1) as P — > oo, instead of O(logP) without 
the legacy MSE distortion constraint. 

Finally, we note that, the cognitive rate maximization prob- 
lem is feasible only under the conditions of 



D < 



4 



and D > D := 



(7) 
(8) 



The first condition (O is obvious, since even without any 
channel output, the MSE distortion in estimating S„ is at most 
<7g. In other words, if D > <r|, the cognitive transmission 
is no longer constrained by the legacy service, and thus 
standard water-filling can be utilized with <j\ = P. The 
second condition ^ poses a limit upon the minimum legacy 
channel gain a. That is, if the receiver is located such that a 
is too small, then the distortion target D cannot be satisfied 
even without any cognitive transmission, and the problem (0]i 
becomes infeasible. 



III. Optimal Legacy Receiver Processing: 
Non-Causal Wiener-Kolmogorov Filtering 

Given a sufficiently long block of channel outputs, the 
MMSE for estimating the legacy signal is achieved by the 
non-causal Wiener-Kolmogorov filter. Therefore as the obser- 
vation interval grows long, the achievable MSE asymptotically 
approaches [14] 
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2tt J_ 7r a<j>s (to) + 4>x M + 0n M 
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2tt J_ w cf> x (w) + a(f>s (w) + 4> N (w) 
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Thus for a prescribed MSE distortion constraint D, we can 
pose the cognitive rate maximization problem as follows. 



max 7T~ / log H , , . ; — i—. 



dw, (10) 
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(12) 



That is, we maximize the information rate of the cognitive 
transmission, under the constraints that the estimation MSE 
of the legacy signal does not exceed a prescribed distortion 
D, and that the power of the cognitive signal does not exceed 
a maximally allowed value P. 

A. The Power Constraint is Always Active 

A surprising observation upon inspecting the cognitive rate 
maximization problem is that, if there is no power constraint 
(Q/Ji with P < oo and if the MSE constraint (fTTb leads to a 
strictly nonempty feasible set for (j>x(u>), then the cognitive 
rate in dTOb can be divergent, i.e., infinitely large. This obser- 
vation can be established as follows; see also Figure |2] If we 
let <fix(u) — * oo over a pair of narrow intervals of frequencies 
< LO\ < | a; | < u) u < 7r, and let (px(uj) — elsewhere, then 
the resultant cognitive rate in ( TTOb diverges. On the other hand, 
the MSE constraint (fTTT > now can be rewritten as 
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Ztt 



Ju] a4>s (w) + 4>n (w) 
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which can be satisfied by choosing the volume of the interval 
[wj,a; u ] sufficiently small but strictly positive, as long as we 
have 



D> D := — 
— 2?r 



n a0 s (w) + ^n(w) 
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(13) 



In fact, a closer inspection of D_ reveals that it is exactly the 
MMSE achieved by non-causal Wiener-Kolmogorov filtering 
when we choose <fix(u) = 0, — it < oj < it. So (TT3T > 
always holds true if the MSE constraint ( fTTl i leads to a strictly 
nonempty feasible set for 4>x(oj). Otherwise, when we have 
D = D, the only feasible choice of <f>x{ui) is 4>x{tS) = 




Fig. 2. Illustration of <px(^) that leads to divergent cognitive rates. 
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Fig. 3. Illustration of the non-convexity of the feasible set of the cognitive 
PSD. 



for all — 7r < u> < tt (possibly except for a subset of measure 
zero), and consequently the achievable cognitive rate vanishes. 

The observation that </>x(w) can lead to divergent cognitive 
rates while still satisfying the MSE distortion constraint im- 
mediately leads to two useful conclusions. First, the power 
constraint ( TTSl i must always hold with equality and thus is 
always active, for arbitrarily large P. In other words, there 
should not be any legacy-induced barrier that regulates the 
cognitive transmission power as in the memoryless estimation 
procedure described in Section [XT] This conclusion is surpris- 
ing, because as we commented in the introduction, without 
message-sharing, all the existing cognitive radio schemes are 
power-constrained so as not to exceed a certain externally 
imposed threshold for its interference to legacy receivers. 
Here, however, our observation implies that such a constraint 
is actually unnecessary and that the cognitive transmitter can 
indeed transmit with its full power. The second conclusion 
led by our observation is that, the cognitive rate maximization 
problem is generally non-convex, in the presence of the 
MSE distortion constraint. Intuitively, the feasible set of the 
cognitive PSD has many "spikes" that correspond to the peaky 
PSD as described in this subsection; see Figure [3] for an 
illustration. The argument in our observation implies that it is 
often beneficial for the cognitive transmitter to operate close 
to those "spikes". 



B. General Necessary Condition for Optimal Cognitive PSD 

Depending upon whether the MSE distortion constraint ( fTTb 
is active, the cognitive rate maximization problem exhibits two 
distinct cases, as elaborated in this subsection. 

In the first case , the power constraint P is small, so that the 
water-filling solution of 4>x(w) with respect to (fT2l > does not 
violate the MSE distortion constraint (fTTT >. The water-filling 
solution, 
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(14) 
(15) 



is the optimal PSD for the cognitive transmission. For this 
case to arise, we need to have 
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2^ 



a<f>\ (oj) 



4>x,wi M + a<j>s (w) + 4>n (u>) 



doj>aj- D, (16) 



where 0x,wf is given by ( TBI . 

In the second case , the inequality condition ( fT6l ) is violated, 
so it follows that both ( fTTb and ( fT2b constraints become active. 
The optimization problem is non-convex, since the feasible set 
of (f>x(w) satisfying ( TTTb is a non-convex set, as suggested by 
the previous subsection. In the following, we present a general 
necessary condition for optimality in this case. 

Proposition 1: For the cognitive rate maximization problem 
( fTOb . in its second case where both the MSE distortion con- 
straint and the power constraint are active, the optimal solution 
of the cognitive PSD (f>x(w) should satisfy the following 
condition: for every \u>\ < tt, either (f>x(u) is a boundary point 



&cM = o, 

or 4>x(u) is a locally maximum point 



(17) 



<j>x{w) = 



y/1 + 4\fia<j>l(uj) + 1 
^2p, 



a0s(w) - 0nM, (18) 



for some /i < and A such that (a) the inequality 

1 + AXfia(j)l(uj) > 



(19) 



holds, and (b) </>x(w) as given by (fT8l is positive. 

Due to the lack of convexity in the optimization problem, 
in general, little can be said beyond Proposition [TJ above. The 
solution of the cognitive rate maximization problem in the 
second case can in principle be obtained through an infinite- 
dimensional search procedure, outlined as follows. Consider an 
arbitrarily chosen measurable subset 3 C [0, tt]. Let 4>x(u) = 
for every \u\ ^ 3. Determine Lagrange multipliers [i < and A 
to generate <f>x(w) according to ( fT8l . for every \ui\ G 3, such 
that both the MSE distortion constraint (TTTb and the power 
constraint ( fT2l i are satisfied with equality. Denote the resulting 
achievable rate calculated following ( fTOb by then the 

maximum cognitive rate is sup R(3) with the supremum taken 
over all measurable subsets of [Q,tt]. 



C. A Suboptimal Solution: On-Off Cognitive PSD 

The fact that the power constraint (TT~2T > is always active 
implies that the cognitive transmitter can always transmit with 
its full power, without introducing excess interference to the 
legacy service, through appropriately tailoring its signaling 
PSD. As the power budget P — > oo, the cognitive rate R c 
behaves like O(logP), which is in fundamental contrast to 
the 0(1) behavior in the memoryless estimation scenario as 
described in Section [TT] 

A lower bound to the prelog coefficient for large P can 
be obtained by considering a specific class of cognitive PSD, 
namely the on-off PSD for which 0x(w) equals either or 
some 0o > 0. We optimize the prelog coefficient lower bound 
over all possible supporting sets of 0o, and the optimized lower 
bound is given by the following proposition. 

Proposition 2: For the cognitive rate maximization problem 
dTol l. as the power budget P — > oo, the on-off cognitive signal 
PSD that maximizes the prelog coefficient of the cognitive rate 
is determined by a parameter 7 > satisfying 
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a0 s (w) + N M 



D-D, 



(20) 



where [■] denotes min{-,7} and D is given by (fT~3l >. The 
corresponding on-off cognitive signal PSD is then 



0xM = 0o • sgn • 



a0| (to) 



a4>s (w) + 0n M 



(21) 



where sgn{ } denotes the sign of its operand, the on-level 0o 
is chosen to satisfy the power constraint (fT2l . and the prelog 
coefficient is the volume of the supporting set 
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a0s(w) + 4>n (w) 



>o 



divided by tt. 

As implied by d20l l, the optimal choice of on-off PSD 
<fix(u) should "fill" the spectral bands in which the "pre- 
emphasized" PSD, a0|(w)/[a0s(w) + 0n(w)], is small, until 
the area (normalized by 2ir) under that PSD equals (D — D_), 
the gap between the prescribed MSE constraint and the MMSE 
without cognitive transmission. 

IV. Case Studies 

In order to gain insights from the analysis of memoryless 
estimation and non-causal Wiener-Kolmogorov estimation, we 
present two case studies in this section. For both case studies, 
the noise process N„ is memoryless with 0n (u>) = cr^, — tt < 
uj < tt. The legacy signal S„ is taken as a memoryless process 
for the first case study, and as a first-order autoregressive (AR) 
process for the second. We note that, although singled out for 
its particular simplicity, the memoryless process is indeed a 
special first-order AR process with innovation rate unity. 



A. Memoryless Legacy Signal Process 

In this case, we have 0s (w) = cr§, —tt < u> < tt. The 
analysis in Section [TT] for memoryless estimation immediately 
leads to 



</»x,oM = Po= min \ P, 



and 



Rc = log 1 
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-tt < W < TT, (22) 



(23) 



For non-causal Wiener-Kolmogorov estimation, the first 
case for which water-filling with respect to the power con- 



straint is optimal corresponds to P < 



oiD 



So 



in the first case, the cognitive rate for non-causal Wiener- 
Kolmogorov estimation coincides with that for memoryless 
estimation. Beyond that threshold power, we enter the second 
case as elaborated in Section [171] and the spectrally flat 0s (w) 
and 0n (u>) make the non-convex optimization problem easy to 
solve. In particular, it readily follows that the on-off cognitive 
PSD, 



0x(w) = 00 



aajP 



(D-D)(c 



— atjc — a 



N I 



M < 



ttP 



and 0x(w) = elsewhere, is an optimal solution, 
maximum cognitive rate thus is 
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(24) 



In Figure [U we plot the cognitive rates as a function of power 
P for memoryless MMSE and non-causal Wiener-Kolmogorov 
filtering legacy receivers. Comparing them, it is immediately 
seen that with the non-causal Wiener-Kolmogorov filter at the 
legacy receiver, the cognitive transmitter is able to achieve 
logarithmic growth in its rate; while with the memoryless 
MMSE estimation at the legacy receiver, the cognitive rate 
always dwells around small values due to the legacy-induced 
barrier upon the cognitive transmission power. 

By noting that D_ — -^V, the asymptotic prelog coeffi- 
cient of R c as P — > 00 can further be rewritten as 



I J li- I . — or _ . . — . j I 

' 9 1 9 2 ' 5 2 21 

ao-iJ cr| acr| <j z s acr| V "S. 

We observe that, two factors affect the large-P growth of 
R c . First, Dja\ quantifies the quality requirement on esti- 
mating S„. The larger D jo\ is, the coarser the legacy service 
quality we can tolerate, while the higher the cognitive rate 
we can achieve. Second, acr|/(T^ quantifies the strength of 
the received legacy signal. For larger aa\/a^, the cognitive 
transmitter can exploit more spectral bands for its transmission 
and thus can achieve higher rates. In the limiting case, as 
acr s/ cr N ~~ * 00 ■> tne P re l°g coefficient approaches Dja\. 
The behavior of the prelog coefficient versus both Dja\ and 
aa\ /ct^j is illustrated in the mesh plot of Figure [5] We also 
note that, if aa^/a^ is too small, then the cognitive rate 
maximization problem becomes infeasible, and the resulting 
prelog coefficient vanishes. 
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Fig. 4. Illustration of the cognitive rates as a function of power P 
for memoryless MMSE and non-causal Wiener-Kolmogorov filtering legacy 
receivers, in the case where the legacy signal process is memoryless. System 
parameters are a% = dB, <r^ = dB, D = -20 dB, and a = 30 dB. 




Fig. 6. Illustration of the cognitive rate prelog coefficient as P — * oo versus 
both D/a^ and acr|/<r^,, for t = 0.1 in the case where the legacy signal 
process is a first-order AR process. 




Fig. 5. Illustration of the cognitive rate prelog coefficient as P — » oo versus 
both -D/oJ and aoj /it?, , in the case where the legacy signal process is 
memoryless. 
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Fig. 7. Illustration of the cognitive rates as a function of power P 
for memoryless MMSE and non-causal Wiener-Kolmogorov filtering legacy 
receivers, in the case where the legacy signal process is a first-order AR 
process. System parameters are oj = dB, <xj^ = dB, D = —20 dB, 
a = 30 dB, and e = 0.1. 



B. First-Order AR Legacy Signal Process 
In this case, S„ evolves as 



(25) 



where the innovation rate < e < 1 characterizes the memory 
in process, and {W„}^L are a sequence of independent, 
identically distributed (i.i.d.) ZMCSCG random variables with 
variance normalized to <r§. Therefore the process S„ has PSD 



0s(w) 



(2-e)-2VT 



-7T < UJ < 7T. 



e cos cl> 



(26) 



We then examine the prelog coefficient for on-off cognitive 
PSD, following Section IIII-CI and plot in Figure [6] the mesh 
plot of the prelog coefficient versus both D/a\ and aa\ / <7^ , 
for e = 0.1, a moderate value of signal correlation. Compared 
with Figure [5] which plots the case for memoryless legacy 
signal processes, we notice that legacy signal correlation 
significantly improves the achievable rate of cognitive trans- 
mission. 



In Figure [7] we plot the cognitive rates as a function of 
power P for memoryless MMSE and non-causal Wiener- 
Kolmogorov filtering legacy receivers. Observations similar to 
that for Figure |4] can be drawn by comparing the rates, and it 
can also be seen that the achievable rates substantially increase 
with increased legacy signal correlation. 

V. On-Off Cognitive PSD for General 
Legacy-Cognitive Topology 

In this section, we briefly present the idea of how we may 
extend the insights from the basic system model of collocated 
receivers to the more general system model in which there 
are multiple legacy receivers not collocated with the cognitive 
receiver. This situation is as illustrated in Figure [8] We assume 
that there are K legacy receivers, each of which has channel 
gain a,k from the legacy transmitter and channel gain from 
the cognitive transmitter, for k = 1, . . . , K. We also denote 
the channel gain from the legacy transmitter to the cognitive 
receiver by oo, and the channel gain from the cognitive 
transmitter to the cognitive receiver by go . For the K legacy 
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Fig. 8. Illustration of a general cognitive radio system where there are 
multiple legacy receivers to be served and the cognitive receiver is not 
collocated with the legacy receivers. 
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Fig. 9. Illustration of the basic channel model for cognitive transmission in 
the presence of coded legacy transmission. 



receivers, each has its own MSE distortion constraint, indexed 
as Dk, for k — 1, .... K. More general scenarios of multiple 
legacy transmitters and multiple cognitive users for spectrum 
sharing are left for future investigation. 

Analogously to the collocated-receiver system model, the 
cognitive rate maximization problem can be formulated. Using 
the same argument as in Section IIII-AI we can again find 
that for sufficiently large P, the cognitive rate maximization 
problem is non-convex with the optimal cognitive PSD ap- 
propriately shaped. For general network topologies, however, 
an analytical characterization of the necessary condition for 
optimality does not appear to be immediately available. In 
the following, we therefore focus on the large-P asymptotic 
analysis of the prelog coefficient, under the assumption of an 
on-off cognitive PSD. 

Consider an on-off cognitive PSD </>x(<^) satisfying 
<fix(u) = <f>o if M € IX and <frx(w) — elsewhere, where 
U is a measurable subset of [0, ir]. Following the same idea 
of Proposition [2] we can find that as P — > oo, the prelog 
coefficient of the cognitive transmission rate is | IXj /tt, and the 
support set U should satisfy the following set of inequalities: 



1 



a k (j>l (u) 
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7r min \Dk 

k=l,...,K l 



(29) 



-dw < Di 



D 



1, 



where the MMSE without cognitive transmission D_ k is 



So the optimal on-off cognitive PSD is obtained by choosing 
U to "fill" the spectral bands in which <fis (u>) is small, until 
is satisfied. 

Part II: Coded Legacy Transmission 

VI. Basic System Model 

In this section, we introduce the basic channel model, 
which consists of a pair of legacy transceivers and a pair of 
cognitive transceivers. All the transceivers are single-antenna 
and thus the signals are all scalar in the model. See Figure 
[9] for an illustration. The legacy transmitter is static, since it 
exists before any potential cognitive transmission. We assume 
that the legacy transmitter employs a white ZMCSCG-like 
random codebook with a fixed rate Ri and a fixed transmitted 
power Og . For feasibility of cognitive transmission, we require 
that the rate Ri is achievable at least without any cognitive 
K, (27) transmission, i.e., 



Feasibility condition: log 1 + 



aiaj 



> R, 



(30) 



D, 



1 

27 



duj. 



(28) 



The optimization problem boils down to maximizing the 
volume |U| subject to the K inequality constraints in d27i >. 

In general, the prelog maximization problem as formulated 
above is not amenable to efficient optimization algorithms. 
Here we only discuss one special case, to gain some practical 
insights. 

Low-Noise Limit: In this asymptotic case, we let 0N fc (w) — > 
uniformly for all — ir < u> < 7r, and for all k = 1, . . . , K. 
The practical scenario is that all the legacy receivers are close 
to the legacy transmitter so that the effect of noise can be 
neglected. The constraints d27l ) now collapse into a single 



where ct^j is the power of the white ZMCSCG noise at the 
legacy receiver. 

The cognitive transmitter is allowed to transmit a WSS 
ZMCSCG-like randomly coded message to the channel. The 
cognitive signal, upon arriving at the legacy receiver, is treated 
as a colored noise since the cognitive codebook is not known 
by the legacy transceivers, similar to the uncoded scenario 
treated in Part I of the paper. The legacy receiver, however, is 
assumed to be able to identify the statistics of the cognitive 
signal, and to perform the corresponding optimal decoding, 
which may be decomposed into a whitening filter stage and 
a decoding stage designed for white ZMCSCG noise. Denote 
the PSD of the cognitive signal by (f>x(cu), then we have the 



following two constraints: 

i r 

Power constraint: — / cj)x{oj)dui < P (31) 
Legacy rate constraint: 



gi(/>x(v) + a^ { 



du; > Ri. (32) 



The cognitive receiver, however, does not treat the legacy 
signal as noise. This is because the cognitive service is built 
aware of the presence of legacy service, and the codebook of 
legacy transmission is broadcast in nature and thus publicly 
known at the cognitive receiver. The legacy transmitter, the 
cognitive transmitter, and the cognitive receiver hence form a 
multiple access (MAC) channel, whereas this MAC channel is 
not exactly the same as that classically treated in information 
theory (e.g., [15]), because here one of the transmitters (the 
legacy transmitter) is static and we can only design the other 
(the cognitive transmitter). 

VII. Analysis and High-Power Regime Behavior 

In this section, we analyze the operation of the cognitive 
transceivers. 

Case A: The cognitive receiver cannot decode the legacy 
signal, even without any cognitive transmission. This case 
arises if the channel gain a c between the legacy transmitter 
and the cognitive receiver is small, 



log 1 



< Ru 



(33) 



where erj^ is the power of the white ZMCSCG noise at 
the cognitive receiver. We can formulate the cognitive rate 
maximization problem under this case, as 



max 

<t>x(u) 



1 

s.t. 



log 1 



— 7T 

1 

2^ 
1 

2^ 



4>x{uo)duo < P 
log [ 1 



dio > Ri. 



Case B: The cognitive receiver can decode the legacy signal. 
This case can be further refined into two sub-cases. 

Case B-l: The cognitive receiver can utilize a successive 
decoding procedure, such that it first decodes the legacy 
signal treating the cognitive signal as noise, then decodes 
the cognitive signal after canceling the legacy signal. This 
situation is possible if 



^1 l0 " 



duj > Ri. 



(34) 
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Fig. 10. Illustration of the rate region of the MAC channel seen by the 
cognitive receiver. The horizontal segment of the rate region corresponds to 
Case B-l, and the slope segment (a.k.a. dominating face) corresponds to Case 
B-2. 



The resulting cognitive rate maximization problem becomes 



max 

x (w) 



1 



7T log 1 
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Case B-2: In this case, the cognitive receiver cannot reliably 
decode the legacy signal treating the cognitive signal as 
noise. Fortunately, we can utilize the idea of "rate-splitting" 
[16] to virtually split the cognitive signal into two parts, 
and accomplish successive decoding following the order of 
"cognitive-legacy-cognitive". Ignoring the incurred technical 
details here, we give the cognitive rate maximization problem 
formulation, as 



max 

0x(^) 



duj — Ri 



, 0x(«)dw < P 

zir 

J_ f log(l+ , L>fl, 



s.t. 
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log 1 + ■ 

\ 3 c 0xM + er N 



, dio < Ri. 



The two scenarios in Case B are illustrated in Figure 
[TOl Case B-l occurs when the legacy rate Ri is within the 
horizontal segment of the MAC rate region, so that direct 
successive coding is feasible. Case B-2 occurs when Ri is 
within the slope segment (a.k.a. the dominating face of the 
MAC rate region), and then rate-splitting is necessary. 

For both Case A and B, since we have assumed that both 
the legacy signal PSD and the noise PSD are flat, it straight- 
forwardly follows from first-order optimality arguments anal- 
ogous to that in Section IIII-BI that the optimal cognitive 



signal PSD is always an on-off PSD with the volume of the 
support spectral bands being optimized. Therefore we can 
readily obtain numerical values of the maximized cognitive 
rate under the various operational cases. We note that, in 
Case B, the cognitive transmitter can shape its signaling PSD 
4>x(w) to choose to operate either under Case B-l or Case B- 
2. Depending upon the system parameters, either of the two 
sub-cases can lead to higher cognitive rates, so the globally 
optimal scheme should be obtained by comparing the resulting 
cognitive rates of the two sub-cases. 

We are primarily interested in the regime where the cogni- 
tive transmission power budget P is large. Similar to the prelog 
coefficient analysis in Section IIII-CI for the case of uncoded 
legacy transmission, here we evaluate the prelog coefficient of 
the cognitive rate as P — ► oo. We find that, although there are 
three possible operating cases (Cases A, B-l, and B-2), the 
maximum prelog coefficient is always unique as given by the 
following proposition. 

Proposition 3: For cognitive transmission in the presence 
of coded legacy transmission, as the power budget P — > oo, 
the optimal prelog coefficient of the cognitive rate is 

* = 1 — (35) 

log (1 + aial/a^J 

Inspecting the expression of ^ in Proposition [3] we can 
make two key observations. First, the growth rate of cognitive 
transmission only depends on parameters of the legacy chan- 
nel. This observation is surprising, because one may initially 
think that the cognitive channel parameters, at least the inter- 
ference ("cross-talk") channel gains a c and gi, would affect 
the asymptotic behavior of the cognitive rate. Second, noting 
that log (l + aia^/a^ ) is nothing but the capacity of the 
legacy channel without cognitive transmission, the cognitive 
rate growth rate "J is simply one minus the normalized load 
of the legacy channel. This observation is intuitively pleasing, 
because it quantitatively reveals the basic tradeoff between 
legacy and cognitive transmissions, - the heavier (lighter) the 
legacy channel is loaded, the lower (higher) rate the cognitive 
transmission is allowed to achieve. 

Example: In this example, we plot cognitive rates in Figure 
[TTl for both Case A and Case B. The common system 
parameters are: ai = gi = cr^. = o"n = dB, er| = 30 
dB, g c = 10 dB. In Case A, we take a c = —20 dB, while in 
Case B a c = dB. The legacy rate Ri is chosen to be half 
of the legacy channel capacity, hence the prelog coefficient is 
\E r = 0.5, as indicated by the slope in the plot. It is evident 
that for both cases, the growth rates of cognitive transmission 
rate approach 'J', as P gets large. 

VIII. Extension to Cognitive Transceivers with 
Multiple Antennas 

The analysis for the basic system model can be extended 
to the case where the cognitive transceivers are equipped with 
multiple antennas, as illustrated in Figure [12] We assume that 
the legacy transceivers are still scalar. The same as in the basic 
system model, the legacy transmitter employs static and white 
ZMCSCG-like random coding with rate Ri and transmitted 
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Fig. 1 1 . Cognitive rates as a function of the power constraint P, for both 
Case A and Case B with system parameters given in the example. 
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Fig. 12. Illustration of the channel model for cognitive transmission in 
the presence of coded legacy transmission. The cognitive transceivers are 
equipped with multiple antennas while the legacy transceivers are scalar. 



power (t|, and correspondingly the legacy receiver performs 
decoding after whitening the received signal. 

To characterize the vector cognitive signal, we need to 
define its power spectrum density matrix; see, e.g., [17] for 
a comprehensive introduction. In this section, we assume that 
the cognitive signal process X„ is a WSS ZMCSCG vector 
process of dimension N t x 1, where N t is the number of 
transmit antennas. So the auto-correlation function of X„ is 



Kx(k) = E [x n+fe Xt 



k = 



-1,0,1, 



(36) 



where [■]'' denotes conjugate transpose of a complex-valued 
vector (or matrix, as in the following development). The PSD 
matrix of X„ therefore is defined as the discrete-time Fourier 
transform of Kx(k), as 



0x(w) 



J2Kx(k)e-^ k , 

k 



-7T < LO < 71". 



(37) 



The vector cognitive signal collapses into a scalar signal at 



the legacy receiver, by the JVj x 1 channel vector 
legacy receiver thus sees a channel as 



/?,. The 



(38) 



Hence the legacy rate constraint enforces the following in- 
equality, 



1 

27 



log 1 



dw > Ri. 



(39) 



The cognitive receiver sees a 7V r -dimensional (N r denoting 
the number of receive antennas) vector MAC channel as 



Ic.n = Vfk£L c X n + y/a^h c S n + N c 



(40) 



In order to evaluate the achievable cognitive rate, we need 
a general formula that relates the entropy rate and the PSD 
matrix of a WSS ZMCSCG vector process. Such a formula 
was established in [18], and can be viewed as a generalization 
of the well-known theory for the asymptotic behavior of 
ordinary Toeplitz matrices and determinants (see, e.g., [19]), 
which we have implicitly utilized throughout the development 
of the previous sections. Specialized to our problem, the basic 
result is that the entropy rate of a WSS ZMCSCG vector 
process is equal to l/(27r) J"" logdet <j>((j)duj, where 4>(lo) 
is the PSD matrix of the process. 

Having introduced the above technical preparation, analysis 
of the operation of the cognitive transceivers closely follows 
that in Section IVHI for scalar cognitive transceivers. Here 
we are again interested in the regime where the cognitive 
transmission power budget P is large, and seek to identify the 
prelog coefficient of the cognitive rate. Note that the power 
constraint for vector cognitive transmission becomes 



1 

2^ 



trace[0x(w)]dcj < P. 



(41) 



Analogously to Proposition [3] for vector cognitive trans- 
mission, the prelog coefficient is given by the following 
proposition. 

Proposition 4: For cognitive transmission in the presence 
of coded legacy transmission, and with cognitive transceivers 
equipped with multiple antennas, as the power budget P — > oo, 
the optimal prelog coefficient of the cognitive rate is 



* = 



1 



Ri 



■ rank [H c ] . (42) 
'reposition |U we notice 



log^ + a/crf/o-^J 
Inspecting the expression of "J in 
that it is the product of two terms. The first term is nothing 
but ^ in Proposition |3] for scalar cognitive transmission, and 
thus the two key observations made therein apply. For the 
second term of the rank of the cognitive channel matrix H c . 
we can make two additional observations. First, since H_ c is 
full-rank with high probability in a propagation environment 
of rich scattering, employing multiple antennas at cognitive 
transceivers is beneficial in that it scales the growth rate of the 
cognitive transmission, thus offsetting the loss due to loading 
the legacy channel. Second, ^> is independent of the spatial 
beam-forming used by the cognitive transmitter. In fact, as 
can be seen from the proof of Proposition [4] the growth rate 
of can be always achieved, as long as we choose the "on" 
PSD matrix 4> Q to satisfy T&nk[H c <t> Ht] = rank[ff e ], - easily 



fulfilled by letting 



lN t xN t , for example. However, we 



note that appropriately shaping (f> Q may lead to additional rate 
gain in the regime of finite cognitive power budget, and this 
is a potential research topic for future investigation. 

Concluding Remarks 
Remarks on System Models 

A few remarks regarding the assumptions of the system 
models used in the paper follow here. 

In both cases (uncoded and coded legacy transmission) 
studied in the paper, the channel models are discrete-time. 
There are some delicate issues in such a modeling strategy, 
related to sampling and time synchronization. For the case of 
uncoded legacy transmission, the baseband legacy signal for 
transmission is generally not pulse-shaped and can be viewed 
as a continuous-time WSS ZMCSCG process. Therefore, to 
obtain the discrete-time channel model at the (legacy and cog- 
nitive) receivers, we simply need to sample the received signal 
in synchronization with the timing epochs of the digitally 
coded and pulse-shaped cognitive signal. Here, an implicit 
requirement is that the pulse width of the cognitive signal 
exactly coincides with the required sampling period of the 
legacy signal for estimation; in other words, the signaling 
bandwidth of the cognitive transmission is set at least equal 
to the bandwidth of the legacy signal, - and the sampling 
period should always be no less than the signal bandwidth for 
alias-free estimation. 

For the case of coded legacy transmission, both the legacy 
signal and the cognitive signal are pulse-shaped. The discrete- 
time channel models therefore imply that the two sets of 
transceivers are time-synchronized at the symbol level. For 
cognitive transceivers, synchronization may be achieved by 
locking into the timing epochs of the legacy signal, after an 
initial period of tracking, which is possible because after all 
we have assumed that the cognitive transmission system has 
knowledge of the legacy codebook and signaling schemes. If 
the cognitive transceivers do not make any effort in keeping 
synchronization with the legacy transceivers, then both (legacy 
and cognitive) receivers experience inter-symbol interference 
(ISI), and our analysis should be modified accordingly to 
account for the effect of ISI, - a research topic for future 
investigation. 

Our treatment of the case of uncoded legacy transmission 
only applies when the legacy signal is linearly modulated 
such that the message to be estimated is proportional to the 
actual signal transmitted. Examples of such systems include 
amplitude-modulation (AM) broadcast radio, analog television 
(NTSC, for example), and so on; but do not include non- 
linearly modulated systems like frequency-modulation (FM) 
broadcast radio or wireless microphone (typically using FM). 
Analysis for those systems can be technically challenging due 
to the lack of convenient tools for quantifying the achievable 
MSE, especially in the presence of correlated cognitive signal. 

Implementation Issues 

The envisioned implementation of the ideas studied in the 
paper can be an overlay cognitive system built in the presence 



of, instead of in the absence of, legacy transmission. As 
revealed by our analysis, the most benefits are obtained when 
the gap between the target and the achievable communication 
quality (in terms of estimation MSE or information rates) of 
the legacy service is large, i.e., when the legacy channel is 
lightly loaded. With spectrum shaping, we can achieve loga- 
rithmic growth of cognitive transmission rates, even without 
knowledge of the legacy message at the cognitive transmitter. 
Certain modifications need to be implemented at the legacy 
receiver, in order to adapt to the statistics induced by the 
cognitive spectrum shaping. However, we note that such modi- 
fications do not demand excess increase in receiver complexity, 
and may be implemented for a dynamic environment through 
adaptive MMSE filtering or noise whitening. 

Some more specific discussion about the implementation 
issues follows here. 

Legacy Reception with Finite Buffering: In the analysis 
of the case of uncoded legacy transmission, the optimal 
estimator at the legacy receiver is a non-causal Wiener- 
Kolmogorov filter, which, in principle, requires observations 
from the infinite future for processing. In implementation, 
we approximate the non-causal Wiener-Kolmogorov filtering 
operation by introducing a large, yet finite, block of delay. 
An interesting question then is how large the delay should be 
chosen, such that the estimation performance is satisfactory 
while the required buffer size and latency is tolerable. If we 
consider the extreme case, where we do not introduce any 
delay such that the legacy estimator is a pure predictor, then 
a related problem is how to optimize the cognitive rate by 
spectrum shaping. For causal estimation, the MSE formula is 
highly complex (see, e.g., [14] [17]), so this problem is non- 
trivial. An interesting question there is whether the cognitive 
transmission rate still logarithmically grows with the power 
budget. 

Multicarrier Implementation: Spectrum shaping implies 
that the coded symbols of cognitive signal are correlated as 
indicated by the PSD. A direct implementation for introducing 
the correlation is transmit side pre-coding. On the other 
hand, due to the increasing popularity of multicarrier systems 
like orthogonal frequency-division multiplexing (OFDM), it is 
useful to examine how spectrum shaping can be implemented 
through multicarrier modulation. In fact, since spectrum shap- 
ing manipulates the signal power across spectral bands, it is 
particularly suitable for multicarrier implementation. In doing 
so, the cognitive transmitter simply divides its whole band- 
width into multiple narrow bins, ensuring that the specified 
cognitive signal PSD does not rapidly change within each 
narrow bin. There is one caution to be exercised for coded 
legacy transceivers if they also employ multicarrier modula- 
tion. That is, the legacy message should be coded in a way 
such that the resulting signal is robust against the effect that 
some subcarriers are essentially "erased" due to the existence 
of "on" cognitive PSD in those subcarriers. Alternatively, the 
cognitive transceivers may spread out their interference to the 
legacy receiver, by pseudo-randomly changing the cognitive 
PSD such that the interfered subcarriers vary with time and 



no single subcarrier suffers from constant interference. 
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